CDS

Accession Number TCMCG004C50926
gbkey CDS
Protein Id XP_025630135.1
Location join(142117101..142118432,142119025..142119198)
Gene LOC112723111
GeneID 112723111
Organism Arachis hypogaea

Protein

Length 501aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA476953
db_source XM_025774350.2
Definition spidroin-1 [Arachis hypogaea]

EGGNOG-MAPPER Annotation

COG_category S
Description cellular component assembly
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0000226        [VIEW IN EMBL-EBI]
GO:0001578        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0006996        [VIEW IN EMBL-EBI]
GO:0007010        [VIEW IN EMBL-EBI]
GO:0007017        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016043        [VIEW IN EMBL-EBI]
GO:0022607        [VIEW IN EMBL-EBI]
GO:0030030        [VIEW IN EMBL-EBI]
GO:0030031        [VIEW IN EMBL-EBI]
GO:0034622        [VIEW IN EMBL-EBI]
GO:0035082        [VIEW IN EMBL-EBI]
GO:0043933        [VIEW IN EMBL-EBI]
GO:0044085        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0044782        [VIEW IN EMBL-EBI]
GO:0060271        [VIEW IN EMBL-EBI]
GO:0065003        [VIEW IN EMBL-EBI]
GO:0070286        [VIEW IN EMBL-EBI]
GO:0070925        [VIEW IN EMBL-EBI]
GO:0071840        [VIEW IN EMBL-EBI]
GO:0120031        [VIEW IN EMBL-EBI]
GO:0120036        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAATTCCAACCAAAACAAATCATCCCCTCTTTCACTCAACAACTACAATTTCGATTTCGATCTCGGCATCGGATCCAATCGCCCCAAATCCCTCAACGACCAGAAAACCCCCAATTCCTCCGCCCCTTCTTACTCCTCCTATTCTTATTCCTCCACCGCCACTTCCTCTTCCCAACCGAGGCCCTCCTGGCAACCCAACAAACCTGCATGGTCCCACAAGCCCGCCTCCGCTCCTGCCACTCAAACTGGGTTGCCCGGTGGTCCCCCTTCCATGGTCGGTGACATCTTCGGCAAGTCATGGGGTTCCACCCAACCTTCCGCTTCTGCCTCCGCCTCCGCCTCCACCGTCGGCATAGCTAACAAAAACCCTAACCTTTTCGGAGACCTGGTCACCTCTGCGCTTGGCCAAGGTCCCAGGAGCTCTTCCAATGTTCCTCTCAAAAACGCAACCCCTGCTTCCAAGCCTTCAGCTCCCGCCACTTCCACCTTTTCCATGGGAAAAATGGAAGATTCTTTGCCCAAAACTGCTAACACCGCACAGAGTAGTACAAATTGGGGATCTTCTGCCAATTTAGGGGGTTCTAGTACCGGGTATGGTGGTACCAGTATGAATTCCAACAAAAGCCCGAATCTTGGGGGTCCTTCATTAAGCACTATGGGTGTTGGTGGTTCAGCTGGTAGTGGATTCAGTTCCAGTAACAATGATCCGTTTAGTTCTCTGTCTGGTATTGGATCAAAGCAATCTGGTGCTGCCAGTCTTAATTCAGCTGCTAAATCCCAAAAGAATGATTTGGGGGATGATGGTTTTGGAGATTTTCAGAATGCTTACAAGCCGACCTCTGCAGCTTCTGGTAATGTTGGAATTGACATTGATTTTGGTGGATCTGCCACCTCGAATCAGACCCCAGTCCAGGGATCTGCTGGTGGTGCTGATCCAATGGACATGTTTTTCTCATCTTCCTCAGCATCTGCCGGAGGCACTGCTGCGGCTACGTCTCAAGGATTTGGAGGACAGGCATCTGCAGAAGTGGAAGATTGGGGTCTGGATTCTGAGTTTGGTGGAGGAGGGCATGATGTGGGTGGCACAACCACTGAGCTTGAAGGGCTTCCCCCACCTCCTGCCGGGGTGTCTGGTGCCGCTGCCAAAAACAAGGGGATGGACAATTACAAGCAGGGCCAGTTTGCTGATGCTATCAAGTGGCTTTCCTGGGCCGTCATCCTTCTGGAGAAAGCCGGGGATAACGCAGGCTCTGTGGAGGTTTTGTCATGCAGGGCTTCCTGTTACAAAGAAGTTGGGGAGTATAAGAAGGCAGTGGCAGATTGTACAAAGGTTCTAGAAAATGATGAGAAAAATGTGTCCGTCCTTGTACAGCGTGCTCTCTTGTATGAGAGTATGGAGAAGTACAGACTTGGTGCTGAAGACCTGAGGACTGTGCTAAAGATTGATCCTGGGAACAGAGTTGCCAGAAGTACTGTTCACCGATTGGCTAAGATGGCCGATTAG
Protein:  
MNSNQNKSSPLSLNNYNFDFDLGIGSNRPKSLNDQKTPNSSAPSYSSYSYSSTATSSSQPRPSWQPNKPAWSHKPASAPATQTGLPGGPPSMVGDIFGKSWGSTQPSASASASASTVGIANKNPNLFGDLVTSALGQGPRSSSNVPLKNATPASKPSAPATSTFSMGKMEDSLPKTANTAQSSTNWGSSANLGGSSTGYGGTSMNSNKSPNLGGPSLSTMGVGGSAGSGFSSSNNDPFSSLSGIGSKQSGAASLNSAAKSQKNDLGDDGFGDFQNAYKPTSAASGNVGIDIDFGGSATSNQTPVQGSAGGADPMDMFFSSSSASAGGTAAATSQGFGGQASAEVEDWGLDSEFGGGGHDVGGTTTELEGLPPPPAGVSGAAAKNKGMDNYKQGQFADAIKWLSWAVILLEKAGDNAGSVEVLSCRASCYKEVGEYKKAVADCTKVLENDEKNVSVLVQRALLYESMEKYRLGAEDLRTVLKIDPGNRVARSTVHRLAKMAD